LP decoding of expander codes: a simpler proof 
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Abstract 

A code C C is a (c, e, <5)-expander code if it has a Tanner graph, where every variable node has 
degree c, and every subset of variable nodes Lq such that \Lq\ < 5n has at least ec|Lo| neighbors. 

Feldman et al. (IEEE IT, 2007) proved that LP decoding corrects ||5f ■ (Sn — 1) errors of (c, e, 5)- 
expander code, where e > | + 

In this paper, we provide a simpler proof of their result and show that this result holds for every 
expansion parameter e > | . 



^ ! 1 Introduction 

The central algorithmic problem in coding theory is the explicit construction of error-correcting codes with 
best possible parameters together with fast encoding and decoding algorithms. Recently, this area has ben- 
efited enormously from insights and viewpoints originating in complexity theory, and numerous intercon- 



o 

. nections between codes and complexity theory have been discovered, which are surveyed for example in 

[9, 12]. The former survey [9] focuses on a notion called list-decoding and the second survey [12] focuses 
mainly on sub-linear algorithms for local testing and local decoding. Basically, there are two different kinds 
of noise models: adversarial and probabilistic. In this paper we consider an adversarial noise model where 
we only assume a bound on the number of errors and not how they are distributed. We refer a reader to the 



seminal paper of Richardson and Urbanke [7] for details concerning probabilistic noise model. 

Surprisingly, all known constructions of asymptotically good error-correcting codes which can be de- 
coded in linear time when a constant fraction of symbols is adversarially corrupted are based on expander 
codes. In particular, when the parity check graph of the low density parity check (LDPC) code has good 
expansion properties the associated code is called an expander code (see Definition 2.1). Expander codes 
and their decoding algorithms are often (implicitly) involved as basic building blocks in the constructions of 
asymptotically good codes which are linear-time decodable. 

More formally, a linear code C C is a (c, e, 5)-expander code if it has a Tanner graph [11] G = 
(L, R, E), where every variable node I £ L has degree c and every subset of variable nodes Lq C L such 
that |Lq| < 5n has at least ec|Lo| neighbors. 
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The celebrated result of Sipser and Spielman [8] showed that regular expander codes with expansion 
parameter e > 3/4 are decodable in linear time from (2e — 1) [5n\ errors. They achieved this by defin- 
ing an extremely simple decoding algorithm, called Flip Algorithm. Feldman et al. [3] showed that the 
linear programming (LP) decoding (suggested in [4]) corrects ■ |_^ n J errors in polynomial time if the 
underlying code is a (c, e, <5)-expander with expansion parameter e > | + ^. It is worth to notice that the 
running time of the LP decoder was strongly improved later (see e.g. [2, 10, 14]), although linear running 
time was not achieved. Then it was shown that similar result can be obtained using a different decoding 
algorithm [13], which in particular runs in linear time. Moreover, [ 13, Proposition D. 1] points out that some 
regular expander codes with expansion parameter 1/2 have minimum Hamming distance 2 and thus cannot 
correct even a single error. This leaves open the intriguing question: whether all regular expander codes with 
expansion parameter more than 1/2 are polynomial time decodable from the constant fraction of errors. 

While the decoding algorithm provided in [13] reaches better (proven) result than LP decoding in [3] 
with regards to regular expander codes, the LP decoder still looks much more powerful. Hence we might 
hope that the future of studying of LP decoding capabilities (see [1, 15, 16, 6, 5]) will resolve the above 
question. 

The proof of the main result of Feldman et al. [3] was indirect and took the LP dual witness twice 
by using the min-cut max-flow theorem. We show a straightforward proof for the result of [3] which in 
particular holds for every expansion parameter e > 2/3. We think that our simplified proof sheds more light 
on the intuition lying behind the LP decoding of expander codes. 

2 Preliminaries 

Let F2 be the binaiy field and [n] be the set {1, . . . , n}. In this work, we consider only linear codes. 
Let C C F n be a linear code over a field F. For w G ¥ n , let supp(w) = {i G [n] \ W{ 7^ 0} and 
\w\ = I supp(u>)|. We define the distance between two words x, y G ¥ n to be A(x, y) = \{i \ Xi ^ y{\\. 
The minimum distance of a code is defined by A(C) = min A(x,y). The dual code is defined by 

x^yeC 

C 1 - = {u G F n I Vc G C : (u, c) = 0}. For T C F n we say that w _L T if for all t G T we have (w, t) = 0. 

Now we define expanders and expander codes. We start from the definition of expanders and then 
proceed to the definition of expander codes. 

Definition 2.1 (Expander Codes). Let C C F?j be a linear code and let G = (L, R, E) be its Tanner graph 
[11], where L = [n] represents the variable nodes and R C C 1 - represents the parity check nodes. Note that 
for every x G F£ we have x _L R if and only if x G C. For I G L and r G R it holds that {I, r} G E if and 
only if I G supp(r). For T C L U R and x G L U R, let 

• N(T) = {xi G L U R I {xi,x 2 } G E for some x 2 G T} be the set of neighbors of T, 

• N(x) = N({x}) be the set of neighbors of the node x. 

Let e, 5 > be constants. Then, G is called a (c, e, 5)-expander if every vertex I G L has degree c and 
for all subsets S C L such that \S\ < 5n we have |iV(5)| > e • c\S\. 

We say that a code C is a (c, e, 5)-expander code if it has a parity check graph that is a (c, e, <5)-expander. 

3 Main Result 

Feldman et al. [3] proved the following theorem. 
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Theorem 3.1 ([3]). If C C IFj is a (c, e, S)-expander code, where e > | + ^ awcf ec is an integer then C is 
decodable from at most • ([Sn\ — 1) errors (in polynomial time) by LP decoding. 

Their proof took the LP dual witness twice and in particular, they applied min-cut max-flow theorem to 
find such a witness. Feldman et al. [3] mentioned that it would be interesting to see a more direct proof of 
this result. 

We reprove their result in the more straightforward manner and obtain a small improvement to the 
required expansion parameter, i.e., we assume only that e > 2/3. 

Theorem 3.2 (Main Theorem). If C C FJJ is a (c, e, S)-expander code, where e > | and ec is an integer 
then C is decodable from at most ||5t • ( \&n\ — 1) errors (in polynomial time) by LP decoding. 

In Section 4 we recall an implicit result (Theorem 4.2) that was shown in [3]. We shall use Theorem 4.2 
in the proof of Theorem 3.2, which appears in Section 5. 

4 Implicit result of Feldman et al. [3] 

Let C C F2 be a code and G = (L = [n], R, E) be its parity check graph. Assume that w € F2 is an input 
word (a corrupted codeword) and U C [n] is a set of error coordinates, i.e., if all bits in w\u were flipped 
then w would be the codeword of C. We say that 7i = — 1 if z G £/ and 73 = +1 if i £ U. 

Definition 4.1 (Feasible weights). Let G = (L, R, E) be the parity check graph of the code C C F 2 ' . A 
setting of edge weights {t^j}^ is called feasible with respect to U if 

1. For all j G R and distinct i, i' £ A r (j) we have r^j + Tj/j > 0. 

2. For alH 6 L we have J2jeN(i) T i,j < 7i- 

The following theorem was implicit in [3] and was argued in [3, Proposition 2] and by the discussions 
in [3, Sections 2 and 3]. 

Theorem 4.2 (Implicit in [3]). If for all U C [n], |J7| < an there exists a feasible settings of weights 
{ T hj}(i j)eE t ^ ien ^ decoding for the code C corrects an errors. Moreover, ifC is an LDPC code then LP 
decoding runs in polynomial time. 

The nice point here is that one can use Theorem 4.2 without any background on the LP decoding tech- 
nique. We also note that this Theorem is related to the LDPC codes in general, not necessarily to the 
expander codes. 

5 Proof of Theorem 3.2 

We start the proof by defining the concept of matching (slightly different and simpler than in [3]). 

Definition 5.1 ((/-Matching). Let M C E be a subset and let q be an integer. With some abuse of notations 
we say that M is a matching if for every j 6 R we have at most one i € [n] such that {i, j} € M. Given a 
subset U C [n] we say that M is a g-matching with respect to U if 

1 . M is a matching. 
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2. For every u G U we have at least q nodes j € R such that {u, j} G M. 
For the rest of the proof, let U C [n] be the set of error locations in the input word, and 

U = {i G [n] \ £/ | |iV(i) n iV(C/)| > (2e - l)c}. 

We let U' = U U U. The following proposition is similar to the corresponding proposition in [3], where an 
appropriate modification was made to fit our definition of matching (Definition 5.1). 

Proposition 5.2. Let U' C [n] and assume e > 2/3. If there exists an (ec) -matching M with respect to U' 
then there is a feasible edge weight assignment (as in Definition 4.1). 

Proof. We assign weights as follows. 

• For j G R such that there is an i G U and (i, j) G M, set Tjj = —x, and for all other %' G iV(j) \ {i} 
set Tj/j = +x 

• For all other j G R, for all i G N(j) set Tjj = 0. 

Clearly, this weighting satisfies the first condition of a feasible weight assignment. For condition 2 we 
distinguish three cases. We recall that ec is an integer. 

1. For i G U we have 7$ = —1. Also, at least ec edges that are incident to i are in M (and each has 
weight —x). All other incident edges have weight or +x. So, the total weight of the incident edges 
to i is at most ec(— x) + (1 — e)c(x) = cx(l — 2e). This is less than —1 as long as x > ( 2e -i)c ' 

2. If i G 17 then 7, = +1. At least ec of i's incident edges ai - e in M, but (trivially) not incident to U. 
These edges have weight 0. The other incident edges have weight either or +x. So, the total weight 
is at most (1 — e)cx which is smaller than +1 as long as x < riZ e \ c - 

3. When i ^ U U U then ^ = +1. We know that i has less than (2e — l)c neighbors in N(U), i.e., at 
most (2e — l)c — 1 neighbors in N(U). All other edges have weight 0. I.e., the total weight is at most 
((2e — l)c — l)(x) which is smaller than +1 as long as x < / 2e -i)c-l ' 

Summarizing our conditions on x, we have ( 2e -i)c < x < m ™ { (l-e)c ' (2e-i)c-i }• There is such an x 
if 2e - 1 > 1 - e, i.e., e > 2/3. □ 

It is not hard to see that a sufficiently strong upper bound on \U\ implies an upper bound on \U'\. 

Proposition 5.3. Assume that \U\ < ||5f ■ L^ n J- Then \U'\ = \U\ + \U\ < Sn. 

Proof. Recall that U = {i G [n] \ U \ \N(i) n N(U)\ > (2e - l)c}. 

We prove that \U U U\ < [5n\ . Assume the contrary and let U" C U such that \U UU"\ = [Sn\ (note 
that \U\< [Sn\). 

We argue that \U"\ < £=%\U\ which implies that \U U U"\ = \U\ + \U"\ < %Ez\U\ < [Sn\ with 
contradiction. Due to the fact that C is a (c, d, e, <5)-expander code we have 

ec(\U\ + \U"\) < \N{UUU")\ = \N(U)\ + \N(U") \ N(U)\ <\U\-c+ \U"\ ■ (c - (2e - l)c). 

Then we have |J7"|(3e - 2)c < |J7|(1 - e)c and \U"\ < ^\U\. Contradiction. □ 
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We are ready to prove Theorem 3.2. 



Proof of Theorem 3.2. Let U C [n] be a set of error locations. By assumption, \U\ < ' L^ n J an ^ 
so, by Proposition 5.3 we have \U'\ = \U\ + \U\ < 5n. By Corollary 5.5 (stated below) there exists an 
(ec)-matching M with respect to U'. By Proposition 5.2 there exists a feasible edge weight assignment. 
Theorem 4.2 implies that the LP decoder succeeds and runs in polynomial time. □ 

5.1 Hall's Theorem 

Let us recall that G = (L, R, E) is the bipartite graph with vertex sets L and R, and the edge set E. We also 
recall Definition 5.1. We let X C L to be a subset of L. 

Theorem 5.4 (Hall's Marriage Theorem). Assume that for every S C X we have \N(S)\ > \S\. Then there 
exists a 1-matching with respect to X. 

Corollary 5.5 (Polygamous form of Hall's Theorem). Let d > 1 be an integer. Assume that for all S C X 
we have \N(S)\ > d\S\. Then there exists d-matching with respect to X. 

Proof. Replace each x € X with d nodes connected to all nodes in N(x). The corollary follows from the 
Hall's theorem (Theorem 5.4). □ 
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